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D i Abstract 

[L ■ 

We prove that for sufficiently large K , it is NP-hard to color A -colorable graphs 
with less than 2 K 1 colors. This improves the previous result of K versus if°( logK > 
in Khot [14]. 

u 

U: 1 Introduction 

In this paper we improve the inapproximability result for approximating the chromatic 
number of graphs. More specifically, given a if -colorable graph, we would like to color 
it with as few color as possible. This problem is closely related to a number of other 
problems such as approximation of independent sets and PCPs with low amortized free bit 
complexity. On the algorithmic side, Karger, Motwani and Sudan [12j give an algorithm 
' to color a if-colorable graph with 0{p}~ 3 '( K+1 ') colors, whereas Blum and Karger [3] 

\/-} , gave an algorithm to color a 3-colorable graph with 0(n 3 / 14 ) colors. There have been 

many works on the hardness side as well. It is known that coloring 3-colorable graph 
with 5 colors is NP-hard, and for general A"-colorable graph it is NP-hard to color with 
K + 2[y] colors [T3J|5]. For all sufficiently large if, the best known gap is by Khot [H] 
which proved that it is NP-hard to color a if -colorable graph with K °0°& K ) colors. This 
is equivalent to saying that it is NP-hard to distinguish a if -colorable graph from a graph 
that cannot be colored with fewer than K°^° gK ^ colors. Assuming a variant of Khot's 
2-to-l Conjecture, Dinur, Mosscl and Regev [7J proved that it is NP-hard to if'-color a 
, if -colorable graph for any 3 < if < K' . 

Khot's hardness result [Hj can be derived either using PCPs from Hastad and Khot 
[XT] or Samorodnitsky and Trevisan [17]. The PCPs is then combined with a random- 
ization technique to show the completeness of the reduction. We can view the results 
in [TT] and [T7] as showing approximation resistance for a family of Boolean predicates 
that has very few accepting inputs — it is NP-hard to approximate Max CSP with those 
predicates better than just doing random assignments. For each integer k > 0, the ap- 
proximation resistance predicates we get from [IT] and [17] has k variables (and thus 2 fc 
possible assignments) but only has 2°( v *^ accepting assignments. The predicate [TT] is 
approximation resistance even on satisfiable instances — or perfect completeness in PCP 
language — while the predicate from [17] is not. It is noted in [M] that having perfect 
completeness makes the reduction for coloring easier. 

In a recent breakthrough, Chan [1] proved approximation resistance for a family of 
predicates on k variables but only has k + 1 accepting assignments whenever k is of 
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the form k — 2 r — 1. Previously, approximation resistance of those predicates are only 
known assuming the Unique Games Conjecture [IS]- Hast [TD] proved that predicates on 
k variables having at most 2 [k/2\ + 1 (= k in the current setting) accepting inputs are 
not approximation resistant. This hardness also matches up to constant factor with the 
algorithm for Max CSP by Charikar, Makarychev and Makarychev [5]- 

It is an interesting question if this improved result can be used as in [Tl] to get better 
approximation gap for chromatic numbers. In the following theorem, we show that this 
is indeed the case. 

Theorem 1. For all sufficiently large K , it is NP-hard to color a K -colorable graph with 
2 K 1 colors. 

In [3], Chan also showed that for any K > 3, there is v — o(l) such that given a 
graph with an induced if-colorable subgraph of fractional size 1 — v, it is NP-hard to 
find an independent set of fractional size l/2^/ 2 + v . We refer to [51 USE] for additional 
discussions on Almost-Coloring. 

2 Label Cover and PCP 

In this section we review basics of Label Cover and PCPs and describe Chan's improved 
PCP construction. 

Let (U, V, E, L, R, II) be an instance of Label Cover, where R — dL for some constant 
d, the tuple (U, V, E) is a weighted bipartite graph, vertices in U are assigned labels from 
[L], and vertices in V are assigned labels from [R]. Each edge e = (u, v) is associated with 
d-to-1 mapping 7r e : [R] — > [L]. Given an assignment A : U — > [L] , V — > [R] , constraint on 
e is satisfied if ir e (A(v)) — A(u). The following theorem combines the celebrated PCP 
theorem [TJ [2] with Raz's parallel repetition theorem [TB] and shows hardness of Label 
Cover. 

Theorem 2. For any constant < a < 1, there are d, R < poly(l/cr) such that the 
problem of deciding a 3-SAT instance with n variables can be Karp-reduced in poly(n) 
time to the problem o/(l, a)-deciding a Label Cover instance of size n 1+ °( 1 \ Furthermore, 
L is a bi-regular bipartite graph with left- and right-degrees poly(l/(j). 

As is the case with many MaxCSP inapproximability results, the above Label Cover 
will be the starting point of our reduction. Typically, the reduction translates labelings 
for u 6 U and v G V to 2^1 and 2\ R \ Boolean variables, respectively. These variables 
are viewed as functions /" : {-1,1}I L I -> {-1,1} and g v : {-l,l} |i?l -> {-1,1}. We 
require that these functions are folded, that is, for any x S { — 1,1}' L ', y £ {— 1,1}' , 
f u (—x) — —f u (x) and g v {—y) = —g v {y). This corresponds to having negated literals in 
the CSP instances. In a correct proof for a satisfiable Label-Cover instance, we expect 
the functions to be long codes for the corresponding labelings of u and v, that is, setting 
f u (x) = x au{u) , and g°{y) = y av { v ). 

For an edge (u, v) in the Label-Cover, we sample queries 

according to some carefully chosen test distribution 7 '. The distribution T has the prop- 
erty that for any / 6 L and r € R such that irt UtV )(r) = I, the predicate P accepts 

(fuix^),--- ,f u (x^),gM m+1) ),--- ,9M m+1) )) 

with probability 1 (or 1 — e for some small constant e if we are considering non-perfect 
completeness) . We can think of this as a 2-player game, functions f u and g v are strategies 
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of each player, and the goal is to convince the verifier who uses predicate P to decide 
acceptance. 

In [3], Chan developed a new way of constructing efficient PCPs and proved that the 
following Hadamard predicate Hk ■ { — 1,1} K — > {0,1} is approximation resistant for 
K — 2 r — 1. The predicate Hk is on variables {xs}^sc[k], defined as 



This predicate has K+l accepting assignments. Samorodnitsky and Trevisan [18] showed 
that Hk is approximation resistance assuming the Unique Games Conjecture. Using his 
new technique, Chan proved that this is true assuming P ^ NP. 

The main idea in Chan's reduction is to consider a direct sum of K independent 
-fC-player games. In the i-th game, an edge ej is sampled as in the above 2-player game. 
Player i gets a uniform random string from { — 1, 1}I L I and all other players get samples 
from { — 1, 1}'^' as described below. Thus for each tuple of K — 1 vertices from V and 
1 vertex from U, the players will have a strategy which is a Boolean function taking 
as input (K — 1) strings of length R and 1 string of length L. In a correct proof, the 
strategies are expected to be products of long codes encoding the labeling of the vertices. 

We now formally define the PCP and how queries are sampled. 

Definition 3. Let (U,V, E, L, R,H) be a label cover instance. Define Vi — V 1 ^ 1 x U x 
V K ~ l for i £ [K]. For each v £ Vi, the proof contains function f v : f{— 1, l} fl V 1 x 
{ — 1, 1} L x ({ — 1, l} fl ) fe ' — 5- { — 1,1}. The verifier check the proof as follows: 

1. Sample independently K random edges e\ — (iti,Ui), • • • = (uk,vk) € E. 

2. Let Vi = («!,-•• , Vi-i, Ui, • • • ,Vk)- The verifiers queries {f Vi (qi)}f!Li7 where 
(\i is sampled as below. 

For i £ [K], first sample uniformly q^i e { — 1, 1} L . Let tt be the projection of edge 
ej. For each r £ R, pick a uniformly random point {x\, • ■ ■ , xk) from the subspace 
of Hk where Xi = qi i i, 7r (r); md set qjj.r = x j for j ^ i. Finally for each bit in q,^ 
i = 1, • • • , K , with probability r/ we resample the bit from the uniform distribution 
on {-1, 1}. 

3. Let f(q) = (f Vl ( qi ),... ,f VK (qjf)). Accept if H K (f{q)) = 1. 

In a correct proof, the function f v is the product of long codes encoding the labeling 
of each vertex in v. 

Theorem E.l along with Theorem A.l, 6.9 and C.2 of [4] shows completeness and 
soundness of the above reduction and we formulate as the following theorem. 

Theorem 4. Fix some small t],S > 0. Let the soundness of Label Cover a be chosen such 
that S — poly(K/rj) ■ <t s1< - 1 ' . Given a Label Cover instance LClal, we have the following: 

1. Lf LCl,cIL has value 1, the verifier accepts a correct proof with probability at least 



2. Lf the verifier accepts with probability greater than (K + l)/2 K + 25, then LCl,cIL 
has value at least a. 

3 Hardness of Approximating Chromatic Number 

In this section, we prove that for sufficiently large K, it is NP-hard to color a if-colorable 





otherwise. 
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graph with less than 2 
K 3 versus 2 K . 



colors. For convenience of notation, we in fact prove a gap of 
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The overall idea is similar to that in Khot [14] . We start by describing the FGLSS 
graph [5] of the PCP in Definition [3J The vertices in the FGLSS graph are function 
queries and corresponding accepting configurations, denoted as (f v ,q, x). The weight of 
the vertex is the probability that query (f v , q) is picked. The total weight of the graph 
is therefore K + 1. Two vertices are connected if they are clearly inconsistent (returning 
different answers for the same query to the same function). An independent set in the 
graph corresponds to a strategy / set of functions, and its weight is the acceptance 
probability of such strategy. For a PCP, the maximum weight independent set in its 
FGLSS graph is related to its acceptance probability. Note that if the maximum weight 
independent set has weight w, then we need at least (K + l)/w colors to color the whole 
graph since vertices having the same color must form an independent set. 

To use the FGLSS graph for coloring results, we also need to show that if a PCP 
has acceptance probability 1 — e, we can color the FGLSS graph with a small number 
of colors. As in [13], we modify Definition [3] so that the functions in the proof become 

f v : ({-Lip 2 ')*" 1 x {-1,1} L ' 2 * x ({-LI}*' 2 *) -> {-1,1}. Alternatively, we can 
think of this as modifying Label Cover by appending a i-bit binary string to all the labels 
and defining the new projection in the Label Cover instance as ir' e (r o a) = 7r e (r) o a for 
r £ fi and a £ {0, 1}*, where "o" denotes string concatenation. The value of this new 
Label Cover instance is exactly the same as the original setting. 

Consider the FGLSS graph of the PCP based on the above modified Label Cover 
problem. Soundness is straightforward. If the new proof makes the verifier accept with 
probability at least (K + l)/2 + 25, then the value of the new Label Cover is at least 
a and thus so is the original instance. 

Let a £ {0, 1}' be some global parameter. If a Label Cover instance has value 1, 
then there are at least 2* different ways of assigning labels satisfying all the edges. In 
a correct proof, functions would be product of long codes encoding labelings extended 
by a, denoted as f( Q ). Next we prove that independent sets corresponding to different 
a £ {0, 1}* covers almost all of the FGLSS graph of the modified PCP. We remove the 
small fraction of vertices that are not covered and this gives the completeness of the 
reduction. 

Formally, we follow Khot's notation and introduce the following definition character- 
izing whether we can cover certain vertex with independent sets. 

Definition 5. A set of queries q is good if for any labelings to the vertices and any 
accepting assignment z, there exists a global extension a, such that if the functions f ^ 
are products of long codes encoding the labelings extended by a, then f^ a )(q) = z. 

Let S — 2~ n ( K } be the soundness parameter of the PCP. By Theorem [3] of Chan, we 
require the soundness of Label Cover to be a — (5 /po\y(K /rj)) ^. 

Lemma 6. Let t be such that 2* = C ■ K 3 for some large constant C . For large enough 
K, at most a weighted fraction of exp(— 0{K)) of the queries is not good. 

Remove the vertices in the FGLSS graph that correspond to queries that are not 
good. The fraction of vertices removed is bounded by exp(— O(K)). In the soundness 
case coloring the FGLSS graph still needs at least K{1 - exp(-0(K)))/2-°^ = 2 n W 
colors. In the completeness case, each a £ {0,1}* is associated with an independent 
set consisting of vertices the form (f v ,q, z), where z = f( Q )(q) and f*™- 1 is a product 
of long codes encoding labelings of v extended by a. Consider any vertex (f v ,q, x) in 
the modified FGLSS graph. By definition q is good so there exists a £ {0,1}* such 
that f v (q) = f( Q °)(q) = x, so it is covered by the independent set associated with ao- 
Therefore the modified FGLSS graph can be colored with 2* = 0(K 3 ) colors. 
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Proof of Lemma\^ Let n = 0(1/ K 2 ). We thus have that the label size L = poly(l/<r) = 
exp(0(K)). 

Fix some labeling of the label cover instance and some accepting assignment z. 

Consider a G {0,1}*. Over the queries sampled, the probability that f( Q '(q) = z 
is 0(1/ K • (1 - n) I<2 ) = 0(e- r ' K2 /K) = 0(1/ K) — the probability of answering z on 
q before adding noise is 1/(K + 1), there are K functions, each being a product of K 
long codes, therefore the answer f( Q )(q) depends on K 2 bits, and if none of the K 2 bits 
are corrupted then the answer is correct. The contribution from other sources to answer 
z is negligible. Note that for different extension a, the bits that reads from q are 
completely different and therefore independent, so we have 



Pr 
q 



Va,f (Q) (q)^z = (1-O(l//0) 2 = exp(-0(2 t /K)). 



For query q, let Q(q) be the event that q is not good in the sense of Definition [5] 
there exists some labeling and some accepting assignment z, such that f( Q )(q) ^ z for 
any a. Taking union bound over all possible labelings and accepting configurations, we 
get that the weighted fraction of q that are bad is 



Pr[Q(q)] < L°^Keq?{-0(2*/K)) = exp(-0{K)). 
q 
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